CDS
Accession Number | TCMCG075C08581 |
gbkey | CDS |
Protein Id | XP_017973251.1 |
Location | join(1698565..1698671,1698752..1698818,1701094..1701293,1701420..1701573,1703994..1704078,1704246..1704337,1705187..1705312,1706169..1706222,1706443..1706511,1707733..1707830,1707935..1708279,1708371..1708458,1710457..1710582,1710667..1710723,1711192..1711271,1711359..1711401,1711806..1711892,1714099..1714188,1714992..1715050,1715126..1715216,1715302..1715467,1715596..1715779,1716790..1717885) |
Gene | LOC18604052 |
GeneID | 18604052 |
Organism | Theobroma cacao |
Protein
Length | 1187aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_018117762.1 |
Definition | PREDICTED: transcription initiation factor TFIID subunit 2 isoform X2 [Theobroma cacao] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGGAGACAGGGATTCATTTTGAGGATAATGTGATACATACTGATAATCAGATACGACGTGCTCGGTGCTGGTTTCCTTGTATTGATGATAATAATCAACGGTGCTGTTACGATCTGGAGTTCACAGTTGCCCACAATCTTGTGGCTGTCAGCAATGGGAGCTTATTATATCAGGTCTTGAGCAAAGATGACCCTCCTCGCAAGACATATGTCTACAGATTAGATGTTCCTGTCGCTGCTCAGTGGATATCTTTGGCAGTTGGACCATTTGAAATCCTCCCTGATCAGCATAATGGTCTCATTTCGCACATGTGTTTACCACCTAACTTGCCAAAGCTACGTAACACAGTGGAGTTTTTCCATAGTGCATTCAGTGATTATGAGCAGTATCTTGATGCAAAGTTTCCATTTGGGTCATACAAGCAAGTTTTTTTAGCTCCTGAGATGGCAATATCCTCGTCAACTTTTGGGGCCTCCTTGAGCATCCTTAGTTCTCAAGTTTTATTTGATGAGAAAGTTATAGATCAGACAATAGACACTTGCATCAAACTTGCTTTTGCCCTTGCAAGACAGTGGTTTGGGGTATATATTACTCCAGAGGCACCAACTGATGAGTGGCTGCTGGATGGTCTTGCTGGTTTTTTGACAGATTTGTTTATCAAGAAATTTTTGGGAAATAATGAGGCACAATATCGAAGATACAAGGCAAATTGTGCTGTTTGCAAAGCTGATGATAGTGGTGCAACAGCTTTGAGTTCCTCTTTTGCTTGCAAGGATTTGTATGGAACCCATTCCATTGGCTTGAATGGAAAAATACGATCATGGAAGTCTGTGGCAATCCTTCAGGTGTTGGAAAAGCAAATGGGACCTGACTTCTTTAAAAAGATTTTGCAAGCAATAATTTCTCGTGCACAAGGTACAACCTGTCCTGTGAGGTCTCTTAGCACAAAAGAGTTCCGGCATTTTGCTAACAAAATTGGAAATCTGGAGCGTCCATTTCTCAAAGAATTTTTCCCTCGGTGGGTAGGATCACACGGATGTCCAGTGCTCAGGATGGGGTTTTCCTACAACAAGCGGAAAAATATCATTGAGTTGGCAGTTTTGCGGGAATGCACAGCTACTCTAGATTCAAGTGTATCAGTTCCGAATGCTAACCCCGATTCTGAAAACCGCGATGGTGATATTGGATGGCCTGGGGTTATGACTGTCAGGGTTTATGAGCTTGATGGCATGTCTGATCATCCGGATCTTCCAATGTCTGGAGATGCATGGCAGCTACTGGAAATAGCATGTCACTCAAAGCTTGCTGCTAGACGCTACCAGAAGCCTAAAAAGGGTTCAAAACCTGATGGCTCTGATGATAATGGTGATATGCCCAGTTTAGATGTGCGCTCTAGTGTAGACTCTCCATTGTTGTGGATTAGGGCAGATCCAGAGATGGAATACCTTGCTGAAATTCATTTTAATCAACCTGTACAGATGTGGATTAATCAGTTAGAGAAGGATGAAGATGTTGTTGCTCAGGCACAAGCAATTGCAGCATTGGAATCTTTACCAGAGTTCTCACCTTCTGTTGTCAATGCTCTGAATAATTTCCTCACTGATTCTAAGGCCTTTTGGAGAGTTCGAATTGAGGCAGCATTTGCATTGGCTAGTACATCTTCTGAGGAAACTGATTTGGCTGGTTTGCAACATCTGGTGAGATTTTATAAAAGTCGAAGGTTTGATGCAGACATTGGACTCCCCAAACCAAATGACTTCCGTGATTTTCCAGAGTACTTTGTTCTTGAGGCCATTCCACGTGCCATAGCTATGGTAAGAGCTGCAGATAAGAAAAGTCCAAGAGAAGCTGTTGAGTTTGTTCTGCAACTTTTGAAGTATAATGATAATAATGGGAATCCTTACTCAGATGTTTTCTGGCTCGCTGCATTAGTGCAGTCAGTTGGTGAACTTGAATTTGGGCAACAGAGTATTTTCCTTTTATCTTCTCTTCTCAAGCGCATCGATCGGCTTTTGCAATTTGACAGGTTGATGCCTAGTTACAATGGGATTTTGACAATCAGCTGCATCCGAACCTTGGCACAAATTGCATTAAAGCTTTCTGGATTCATCCACCTAGATCATGTCTGTGAACTGATTAAACCATTTCGAGATTTCAAGACAATCTGGCAAGTACGAATAGAAGCAAGCAGAGCACTCCTTGATCTTGAGTTTAACTGCAATGGCATCAATGCAGCATTGTTGTTGTTTATTAAATATATAGAGGAAGAGCCTTCTTTAAGAGGGCAGGTAAAGTTGGGTGTGCATGCTATGCGGTTATGTCAGATACGAGGTGGATCAGTTTCTAATGAGGATATTAAGTCGACCACTCTTGTGGCTTTGCTTCAGCTTTTAGAGAGCCGCACAGCATTCAATAATGTATCTCTCCGGCACTACATGTTCTCCATTCTTCAAGTCCTTGCAGGAAGAACCCCCACACTTTATGGAGTGCCTAAAGATAAGGTACGGCGAATGGCTGATGTGGAGGTTTGCAATGAGCAGAAGAACCATTTTGCAGCTCCTGTTGCAGAGATAAAGCCTGCTGAACCTCCCGCGGCGAACCCGAACCTTTTGCATGATAATCTGGCCATTCCAGAAGCTTCCAAGGGAGTGGATACTGTTTCCAACAGTCATGAGAGGAAGACATCCGTTGTTAAGATTAGGGTCAAGCAGTCTGGGACAACCAGTAAAGCAGAGGAAGGTGACGATGCTACCGTCGAAAGATCTCAAGGAAGGCATCCTGATGCCGATCGCGGCGCCACCAGTTCGGTTTCAGTGGATGCACCCCAAAGAAATTCAGCTGAGGCTGTGAGCATTAGCAATCAAAATATCGAAGAAGTCAACTCATTTCATGATCACGGGTCTCGGATCACCGCTAGCATTGGGAGTGCAAAAATTGCAAGTGAAGGTGACAACTTTGGTAAGGAACTTCAGTGTACTGCCGATTCAAGTAATGTTGCCGCGTGTCCTAGGCCCGATAATCCGTCATCACCTAGCATCATCCAAGATAACTACATAGATGCTGAAGGACAAAAGTTCGCAAGTCTTCAAACCCTATCAGTTTCAAGACAGGATGGTGGTTCATTGGGCACTGTGGATTCTCCAAACCGCGGCAAGGAGAAGAAGAAGAAGAAGAAGGACAAGGAAAAGAAAAAAGATAAGGAAAAGAAACGAAAGCGAGAAGACCACAAAGGACACCGAGACGATCTCGAGTATTTAGAGAAGAAGCGATTGAAAAAGGAGAGAAAACACAAGGAAAAGGAGATGGCAAAGCTGCTGAGTGAAGCCAAGACGACTTCAACAACAGAATTACGAGGTAAGAAAGAGGAAACGACATCTTTAACAAAAGAGTTGCCTGGTAAGAAAGAGGAACTGGTTGCGAAGTCAGCAACGGTGCCATTGAAACCAAGTGCACCCCCCAAGGTAGTGATAACAAAGTCGGAAACCAGGACGGAGCCAACAGAAGGTACTTCAGCTCCCAAATTCCGAATAAAAATAAAGAACAAGTCACTGAATAAATCATAG |
Protein: METGIHFEDNVIHTDNQIRRARCWFPCIDDNNQRCCYDLEFTVAHNLVAVSNGSLLYQVLSKDDPPRKTYVYRLDVPVAAQWISLAVGPFEILPDQHNGLISHMCLPPNLPKLRNTVEFFHSAFSDYEQYLDAKFPFGSYKQVFLAPEMAISSSTFGASLSILSSQVLFDEKVIDQTIDTCIKLAFALARQWFGVYITPEAPTDEWLLDGLAGFLTDLFIKKFLGNNEAQYRRYKANCAVCKADDSGATALSSSFACKDLYGTHSIGLNGKIRSWKSVAILQVLEKQMGPDFFKKILQAIISRAQGTTCPVRSLSTKEFRHFANKIGNLERPFLKEFFPRWVGSHGCPVLRMGFSYNKRKNIIELAVLRECTATLDSSVSVPNANPDSENRDGDIGWPGVMTVRVYELDGMSDHPDLPMSGDAWQLLEIACHSKLAARRYQKPKKGSKPDGSDDNGDMPSLDVRSSVDSPLLWIRADPEMEYLAEIHFNQPVQMWINQLEKDEDVVAQAQAIAALESLPEFSPSVVNALNNFLTDSKAFWRVRIEAAFALASTSSEETDLAGLQHLVRFYKSRRFDADIGLPKPNDFRDFPEYFVLEAIPRAIAMVRAADKKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSIFLLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLAQIALKLSGFIHLDHVCELIKPFRDFKTIWQVRIEASRALLDLEFNCNGINAALLLFIKYIEEEPSLRGQVKLGVHAMRLCQIRGGSVSNEDIKSTTLVALLQLLESRTAFNNVSLRHYMFSILQVLAGRTPTLYGVPKDKVRRMADVEVCNEQKNHFAAPVAEIKPAEPPAANPNLLHDNLAIPEASKGVDTVSNSHERKTSVVKIRVKQSGTTSKAEEGDDATVERSQGRHPDADRGATSSVSVDAPQRNSAEAVSISNQNIEEVNSFHDHGSRITASIGSAKIASEGDNFGKELQCTADSSNVAACPRPDNPSSPSIIQDNYIDAEGQKFASLQTLSVSRQDGGSLGTVDSPNRGKEKKKKKKDKEKKKDKEKKRKREDHKGHRDDLEYLEKKRLKKERKHKEKEMAKLLSEAKTTSTTELRGKKEETTSLTKELPGKKEELVAKSATVPLKPSAPPKVVITKSETRTEPTEGTSAPKFRIKIKNKSLNKS |